Where Does the Alignment Score Distribution Shape Come from?

نویسندگان

  • Philippe Ortet
  • Olivier Bastien
چکیده

Alignment algorithms are powerful tools for searching for homologous proteins in databases, providing a score for each sequence present in the database. It has been well known for 20 years that the shape of the score distribution looks like an extreme value distribution. The extremely large number of times biologists face this class of distributions raises the question of the evolutionary origin of this probability law.WE INVESTIGATED THE POSSIBILITY OF DERIVING THE MAIN PROPERTIES OF SEQUENCE ALIGNMENT SCORE DISTRIBUTIONS FROM A BASIC EVOLUTIONARY PROCESS: a duplication-divergence protein evolution process in a sequence space. Firstly, the distribution of sequences in this space was defined with respect to the genetic distance between sequences. Secondly, we derived a basic relation between the genetic distance and the alignment score. We obtained a novel score probability distribution which is qualitatively very similar to that of Karlin-Altschul but performing better than all other previous model.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Whither Mental Health Policy-Where Does It Come from and Does It Go Anywhere Useful?; Comment on “Cross-National Diffusion of Mental Health Policy”

Factors influencing cross-national diffusion of mental health policy are important to understand but complex to research. This commentary discusses Shen’s research study on cross-national diffusion of mental health policy; examines the extent to which the three questions researched by Shen (whether countries are more likely to have a mental health policy (a) the earlier a country becomes a memb...

متن کامل

Image Classification via Sparse Representation and Subspace Alignment

Image representation is a crucial problem in image processing where there exist many low-level representations of image, i.e., SIFT, HOG and so on. But there is a missing link across low-level and high-level semantic representations. In fact, traditional machine learning approaches, e.g., non-negative matrix factorization, sparse representation and principle component analysis are employed to d...

متن کامل

gpALIGNER: A Fast Algorithm for Global Pairwise Alignment of DNA Sequences

Bioinformatics, through the sequencing of the full genomes for many species, is increasingly relying on efficient global alignment tools exhibiting both high sensitivity and specificity. Many computational algorithms have been applied for solving the sequence alignment problem. Dynamic programming, statistical methods, approximation and heuristic algorithms are the most common methods appli...

متن کامل

FACE ALIGNMENT USING BOOSTED APPEARANCE MODEL (Discriminative Appearance Model)

This thesis explores method of face alignment using Boosted Appearance Model (BAM). Like Active Appearance Model (AAM) we call our method as Boosted Appearance Model (BAM) since our appearnce model is trained by boosting. In this method, face alignment is done by maximizing the score of a trained two-classifer which is able to distinguish correct alignment and incorrect alignment, so that the c...

متن کامل

An Application of Non-response Bias Reduction Using Propensity Score Methods

‎Normal distribution is widely used in many applications‎. ‎The problem of testing whether observations come from a normal distribution has been studied extensively by many researchers‎. ‎Our main goal in this article is to present a simple test procedure for testing multivariate ‎normality‎‎.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 6  شماره 

صفحات  -

تاریخ انتشار 2010